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Abstract. We develop a statistical mechanical interpretation of algorithmic informa- 
tion theory by introducing the notion of thermodynamic quantities, such as free energy, 
energy, statistical mechanical entropy, and specific heat, into algorithmic information 
theory. We investigate the properties of these quantities by means of program-size com- 
plexity from the point of view of algorithmic randomness. It is then discovered that, in 
the interpretation, the temperature plays a role as the compression rate of the values 
of all these thermodynamic quantities, which include the temperature itself. Reflecting 
this self-referential nature of the compression rate of the temperature, we obtain fixed 
point theorems on compression rate. 
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1 Introduction 

Algorithmic information theory is a framework to apply information-theoretic and probabilistic 
ideas to recursive function theory. One of the primary concepts of algorithmic information theory 
is the program-size complexity (or Kolmogorov complexity) H{s) of a finite binary string s, which 
is defined as the length of the shortest binary program for the universal self-delimiting Turing 
machine U to output s. By the definition, H{s) can be thought of as the information content of the 
individual finite binary string s. In fact, algorithmic information theory has precisely the formal 
properties of classical information theory (see [3]). The concept of program-size complexity plays 
a crucial role in characterizing the randomness of a finite or infinite binary string. In [4] Chaitin 
introduced the halting probability Vt as an example of random infinite string. His $7 is defined as 
the probability that the universal self-delimiting Turing machine U halts, and plays a central role 
in the development of algorithmic information theory. The first n bits of the base-two expansion 
of O solves the halting problem for a program of size not greater than n. By this property, the 
base-two expansion of is shown to be a random infinite binary string. In [5] Chaitin encoded 
this random property of O onto an exponential Diophantine equation in the manner that a certain 
property of the set of the solutions of the equation is indistinguishable from coin tosses. Moreover, 
based on this random property of the equation, Chaitin derived several quantitative versions of 
Godel's incompleteness theorems. 

In [15^ [T6] we generalized Chaitin's halting probability Q. to by 
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so that the degree of randomness of can be controlled by a real number D with < D < 1. Here, 
domC/ denotes the set of all programs p for U. As D becomes larger, the degree of randomness of 
$7^ increases. When D = 1, becomes a random real number, i.e., = Q. The properties of 
and its relations to self-similar sets were studied in [15^ [16] . 

Recently, Calude and Stay [3] pointed out a formal correspondence between Q,^ and a par- 
tition function in statistical mechanics. In statistical mechanics, the partition function Z{T) at 
temperature T is defined by 

z(r) = J^e-w, 

where X is a complete set of energy eigenstates of a statistical mechanical system and Ex is the 
energy of an energy eigenstate x. The constant k is called the Boltzmann Constant. The partition 
function Z{T) is of particular importance in equilibrium statistical mechanics. This is because all 
the thermodynamic quantities of the system can be expressed by using the partition function Z{T), 
and the knowledge of Z{T) is sufficient to understand all the macroscopic properties of the system. 
Calude and Stay [3] pointed out that the partition function Z(T) has the same form as by 
performing the following replacements in Z{T): 

Replacements 1.1. 

(i) Replace the complete set X of energy eigenstates x by the set domf/ of all programs p for U. 
(ii) Replace the energy of an energy eigenstate x by the length \p\ of a program p. 
(Hi) Set the Boltzmann Constant k to l/ln2, where the In denotes the natural logarithm. 

In this paper, inspired by their suggestion, we develop a statistical mechanical interpretation of 
algorithmic information theory, where appears as a partition function. 

Generally speaking, in order to give a statistical mechanical interpretation to a framework which 
looks unrelated to statistical mechanics at first glance, it is important to identify a microcanonical 
ensemble in the framework. Once we can do so, we can easily develop an equilibrium statisti- 
cal mechanics on the framework according to the theoretical development of normal equilibrium 
statistical mechanics. Here, the microcanonical ensemble is a certain sort of uniform probability 
distribution. In fact, in the work [T7] we developed a statistical mechanical interpretation of the 
noiseless source coding scheme in information theory by identifying a microcanonical ensemble in 
the scheme. Then, in \17} the notions in statistical mechanics such as statistical mechanical entropy, 
temperature, and thermal equilibrium are translated into the context of noiseless source coding. 

Thus, in order to develop a statistical mechanical interpretation of algorithmic information 
theory, it is appropriate to identify a microcanonical ensemble in the framework of the theory. 
Note, however, that algorithmic information theory is not a physical theory but a purely mathe- 
matical theory. Therefore, in order to obtain significant results for the development of algorithmic 
information theory itself, we have to develop a statistical mechanical interpretation of algorithmic 
information theory in a mathematically rigorous manner, unlike in normal statistical mechanics in 
physics where arguments are not necessarily mathematically rigorous. A fully rigorous mathemati- 
cal treatment of statistical mechanics is already developed (see [14j). At present, however, it would 
not as yet seem to be an easy task to merge algorithmic information theory with this mathematical 
treatment in a satisfactory manner. 
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On the other hand, if we do not stick to the mathematical strictness of an argument, we can 
develop a statistical mechanical interpretation of algorithmic information theory while realizing a 
perfect correspondence to normal statistical mechanics. In fact, in the last part of this paper (i.e., 
in Section [6]) we develop a statistical mechanical interpretation of algorithmic information theory 
by making an argument on the same level of mathematical strictness as statistical mechanics in 
physics. There, we identify a micro canonical ensemble in algorithmic information theory in a 
similar manner to [T^, based on the probability measure which gives Chaitin's the meaning of 
the halting probability actually. In consequence, for example, the statistical mechanical meaning 
of is clarified. 

In the main part of this paper, for mathematical strictness we develop a statistical mechanical 
interpretation of algorithmic information theory in a different way from the physical argument in 
Section (60 We introduce the notion of thermodynamic quantities into algorithmic information the- 
ory based on Replacements [TTT] above. Section [6] plays a role in clarifying the statistical mechanical 
meaning of these notion and motivating the introduction of them in the main part of this paper. 

After the preliminary section on the mathematical notion needed in this paper, we prove some 
results on the degree of randomness of real numbers in Section [3l These results themselves and the 
techniques used in proving these results are frequently used throughout this paper. 

Then, in Section [J] we introduce the notion of the thermodynamic quantities at any given 
fixed temperature T, such as partition function, free energy, energy, statistical mechanical entropy, 
and specific heat, into algorithmic information theory by performing Replacements 11.11 for the 
corresponding thermodynamic quantities in statistical mechanics. These thermodynamic quantities 
in algorithmic information theory are real numbers which depend only on the temperature T. We 
prove that if the temperature T is a computable real number with < T < 1 then, for each of these 
thermodynamic quantities, the compression rate by the program-size complexity H is equal to T. 
Thus, the temperature T plays a role as the compression rate of the thermodynamic quantities in 
this statistical mechanical interpretation of algorithmic information theory. 

Among all thermodynamic quantities in thermodynamics, one of the most typical thermody- 
namic quantities is temperature itself. Thus, based on the results of Section [H the following 
question naturally arises: Can the compression rate of the temperature T be equal to the temper- 
ature itself in the statistical mechanical interpretation of algorithmic information theory ? This 
question is rather self-referential. However, in Section [5] we answer it affirmatively by proving 
Theorem 15.11 One consequence of Theorem 15.11 has the following form: For every T E (0,1), if 

= J2p£domU is a computable real number, then 



where r„ is the first n bits of the base-two expansion of T. This is just a fixed point theorem on 
compression rate, which reflects the self-referential nature of the question. 

The works \15\ [16] on might be regarded as an elaboration of the technique used by Chaitin 
[4] to prove that 0, is random. The mathematical results of this paper, which are obtained except 
for in Section [6l may be regarded as further elaborations of the technique. 

Finally, in Section [6l based on a physical and informal argument we develop a total statistical 
mechanical interpretation of algorithmic information theory which attains a perfect correspondence 

^ We make an argument in a fully mathematically rigorous manner in this paper except for Section [6] Any 
consequence of the argument in Section [6] is not used in any other parts of this paper. 
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to normal statistical mechanics. In consequence, we justify the interpretation of as a partition 
function and clarify the statistical mechanical meaning of the thermodynamic quantities introduced 
into algorithmic information theory in Section [H 

2 Preliminaries 

We start with some notation about numbers and strings which will be used in this paper. N = 
{0,1,2,3,...} is the set of natural numbers, and N+ is the set of positive integers. Z is the 
set of integers, and Q is the set of rational numbers. M is the set of real numbers. {0, 1}* = 
{A, 0, 1, 00, 01, 10, 11, 000, 001, 010, . . . } is the set of finite binary strings where A denotes the empty 
string. For any s € {0, 1}*, |s| is the length of s. A subset S of {0, 1}* is called a prefix-free set if 
no string in 5* is a prefix of another string in S. {0, 1}°° is the set of infinite binary strings, where 
an infinite binary string is infinite to the right but finite to the left. For any a G {0, 1}°°, an is 
the prefix of a of length n. For any partial function /, the domain of definition of / is denoted by 
dom/. We write "r.e." instead of "recursively enumerable." 

Normally, o(n) denotes 9,ny on6 function f '. — > M such, that hm^^^oo 

f{n)/n = 0. On the 

other hand, 0(1) denotes any one function g: — > M such that there is C € R with the property 
that \g{n)\ < C for all n G N+. 

Let T be an arbitrary real number. T mod 1 denotes T — [TJ , where \ T\ is the greatest integer 
less than or equal to T, and T mod 'l denotes T — [T] + 1, where [T] is the smallest integer greater 
than or equal to T. Hence, T mod 1 G [0, 1) but T mod '1 G (0, 1]. We identify a real number T 
with the infinite binary string a such that O.a is the base-two expansion of T mod 1 with infinitely 
many zeros. Thus, T„ denotes the first n bits of the base-two expansion of T mod 1 with infinitely 
many zeros. 

We say that a real number T is computable if there exists a total recursive function / : Q 
such that |T — f{n)\ < 1/n for all n G N"*". We say that T is right- computable if there exists a total 
recursive function g'. — ^ ^ such that T ^ -^^^ ^ ^ M"'" and lim^ — g{n) = T. We say 
that T is left- computable if —T is right-computable. It is then easy to see that, for any T G M, T is 
computable if and only if T is both right-computable and left-computable. See e.g. |1H [T9] for the 
detail of the treatment of the computability of real numbers and real functions on a discrete set. 

2.1 Algorithmic information theory 

In the following we concisely review some definitions and results of algorithmic information the- 
ory [H [6]. A computer is a partial recursive function C: {0,1}* {0,1}* such that domC 
is a prefix-free set. For each computer C and each s G {0,1}*, Hc{s) is defined by Hc[s) = 
min { IpI | p G {0, 1}* & C(p) = s }. A computer U is said to be optimal if for each computer C 
there exists a constant sim(C) with the following property; if C{p) is defined, then there is a p' 
for which U{p') = C{p) and \p'\ < \p\ + sim(C). It is then shown that there exists an optimal 
computer. We choose any one optimal computer U as the standard one for use, and define H(s) 
as Hij{s), which is referred to as the program-size complexity of s, the information content of s, or 
the Kolmogorov complexity of s [9l 110^ [4]. Thus, H{s) has the following property: 

VC : computer H{s) < Hds) + sim(C). (2) 
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It can be shown that there is c G N such that, for any s ^ X, 

< |s| + 21og2 |s| +c. (3) 
For each s € {0, 1}*, P{s) is defined as Ylu{p)=s 2^'^'- Chaitin's halting probability O is defined 

by 

pgdom U 

For any a € {0,1}°°, we say that a is weakly Chaitin random if there exists c G N such that 
n — c < H{an) for all n € N"*" [U [6]. Then [4J showed that 0, is weakly Chaitin random. For any 
a £ {0, 1}°°, we say that a is Chaitin random if lim„_^oo H{an) — n = oo [U [6]. It is then shown 
that, for any a G {0, 1}°°, a is weakly Chaitin random if and only if a is Chaitin random (see |;6j 
for the proof and historical detail). Thus $7 is Chaitin random. 

The class of computers is equal to the class of functions which are computed by self- delimiting 
Turing machines. A self-delimiting Turing machine is a deterministic Turing machine which has 
two tapes, a program tape and a work tape. The program tape is infinite to the right, while the 
work tape is infinite in both directions. An input string in {0, 1}* is put on the program tape. 
See Chaitin ^ for the detail of self-delimiting Turing machines. Let M be a self-delimiting Turing 
machine which computes the optimal computer U. Then P{s) is the probability that M halts and 
outputs s when M starts on the program tape filled with an infinite binary string generated by 
infinitely repeated tosses of a fair coin. Therefore 0, = Y2s£{o i}* ^(^) ^^e probability that M 
just halts under the same setting. 

The program-size complexity H{s) is originally defined using the concept of program-size, as 
stated above. However, it is possible to define H(s) without referring to such a concept, i.e., we first 
introduce a universal probability m, and then define H{s) as — log2m'(s). A universal probability 
is defined through the following two definitions [2Qj . 

Definition 2.1. For any r: {0,1}* [0,1], we say that r is a lower- computable semi-measure if 
r satisfies the following two conditions: 

(V Ese{o,iY r{s) < 1. 

(a) There exists a total recursive function f : N"*" x {0, 1}* Q such that, for each s G {0, 1}*, 
lim„_»oo fin, s) = r{s) and Vn G /(n, s) < r{s). □ 

Definition 2.2. Let m be a lower- computable semi-measure. We say that m is a universal proba- 
bility if for any lower- computable semi-measure r, there exists a real number c > such that, for 
all s £ {0,1}*, cr{s) <m{.s). □ 

Chaitin [4] showed the following theorem. 

Theorem 2.3. Both 2-^^"^ and P(s) are universal probabilities. □ 

By Theorem 12.31 we see that, for any universal probability m, 

F(s) = -log2m(s)+0(l). (4) 

Thus it is possible to define H{s) as — log2 m{s) with any one universal probability m instead of as 
Hu{s). Note that the difference up to an additive constant is inessential to algorithmic information 
theory. 
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In the works [15\ [T6] , we generalized the notion of the randomness of an infinite binary string 
so that the degree of the randomness can be characterized by a real number D with < D < 1 as 
follows. 

Definition 2.4 (weakly Chaitin L>-random). Let D e M with D > 0, and let a £ {0,1}°°. We 
say that a is weakly Chaitin D-mndom if there exists c G M such that Dn — c < H{an) for all 
nGN+. □ 

Definition 2.5 (D-compressible). Let D E M with D > 0, and let a G {0, 1}°°. We say that a is 
D- compressible if H{an) < Dn + o{n), which is equivalent to 

n— >oo n 

□ 

In the case of D = 1, the weak Chaitin D-randomness results in the weak Chaitin randomness. 
For any D € [0, 1] and any a £ {0, 1}°°, if a is weakly Chaitin L)-random and D-compressible, then 

hm ^ = D. (5) 

n— >oo 77, 

Hereafter the left-hand side of ([5]) is referred to as the compression rate of an infinite binary string a 
in general. Note, however, that ([5]) does not necessarily imply that a is weakly Chaitin Z?-random. 

In the works [151 116j . we generalized Chaitin's halting probability to by ([T]) for any real 
number D > 0. Thus, Q = 17^. If < D < 1, then converges and < Q,^ < 1, since 

<n<i. 

Theorem 2.6 (Tadaki [13 US]). Let D eR. 

(i) IfO < D <1 and D is computable, then Q.^ is weakly Chaitin D-random and D- compressible, 
(a) Ifl<D, then il^ diverges to oo. □ 

Definition 2.7 (Chaitin D-randomness, Tadaki [15^ [T6]). Let D G R with D > 0, and let a G 
{0, 1}°°. We say that a is Chaitin D-random i/lim„^oo H{an) — Dn = oo. □ 

In the case of D = 1, the Chaitin D-randomness results in the Chaitin randomness. Obviously, 
for any D G [0,1] and any a G {0,1}°°, if a is Chaitin D-random, then a is weakly Chaitin 
D-random. However, in 2005 Reimann and Stephan [13| showed that, in the case of D < 1, the 
converse does not necessarily hold. This contrasts with the equivalence between the weakly Chaitin 
randomness and the Chaitin randomness, each of which corresponds to the case of D = 1. In the 
next section, for any computable real number D with < D < 1, we give an instance of a real 
number which is Chaitin Z?-random and D-compressible. 



3 Chaitin D-randomness and divergence 

For each real numbers Q > and D > 0, we define W{Q, D) by 

W{Q,D)= |p[«2-^. 

pGdom U 

As the first result of this paper, we show the following theorem. 
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Theorem 3.1. Let Q and D be positive real numbers. 

(i) If Q and D are computable and < D < 1, then W{Q,D) converges to a left- computable real 
number which is Chaitin D-random and D -compressible. 

(a) Ifl<D, then W{Q,D) diverges to oo. □ 

The techniques used in the proofs of Theorem l2.6l (see [16j) and Theorem 13. II are frequently used 
throughout the rest of this paper as basic tools. We see that the weak Chaitin D-randomness in 
Theorem l2.6l is replaced by the Chaitin L)-randomness in Theorem l3.1l in exchange for the divergence 
at Z? = 1. In order to derive this divergence we make use of Theorem 13.21 (i) below. We prove 
Theorem l3.2l in a more general form, and show that the Shannon entropy — ^^£{0 i}* "^(-s) log2 ?7i(,s) 
of an arbitrary universal probability m diverges to oo. We say that a function /: N"^ [0, oo) 
is lower- computable if there exists a total recursive function a: x Q such that, for each 

n £ N+, liuik^oo a{k,n) = f{n) and VA; G N+ a{k,n) < f{n). 

Theorem 3.2. Let A be an infinite r.e. subset of {0,1}* and let f : N"*" [0, oo) be a lower- 
computable function such that lim„^oo /(^) = oo. Then the following hold. 

(V Ec/(p)6a/(I^'I)2"'^' diverges to oo. 

(a) If there exists Iq G such that /(/)2~' is a nonincreasing function of I for all I > Iq, then 
Y.seAfi^i^))'^'^^'^ diverges to oo. 

Proof, (i) Contrarily, assume that X];7(p)gyi converges. Then, there exists d G N"*' such 
that Ec7(p)eA /(bl)2"'^' < d. We define the function r : {0, 1}* [0, oo) by 

^(*) = ^ E /(H)2''"' 

Uip)=s 

if s £ A; r{s) = otherwise. Then we see that X^sg{o,i}* ''(^) — therefore r is a lower- 

computable semi-measure. Since P{s) is a universal probability, there exists c G such that 
r(s) < cP{s) for all s G {0, 1}*. Hence we have 

M-/(b|))2-|Pl >0 (6) 

U{p)=s 

for all s A. On the other hand, since A is an infinite set and lim^^oo f{^) = oo, there is sq G ^ 
such that f{\p\) > cd for all p with U{p) = sq. Therefore we have Ylu{p)=so('^d — /(bl))2~'^' < 0. 
However, this contradicts ([6]), and the proof of (i) is completed. 

(ii) We first note that there is no G N such that H{s) > Iq for all s with \s\ > uq. Now, let 
us assume contrarily that ^g^j^ f{II{s))2~^^^^ converges. Then, there exists d G N"^ such that 
^3g^/(F(s))2-^(*) < d. We define the function r: {0,1}* ^ [0,oo) by 

ris) = i/(//(s))2-^(^) 

if s G A and |s| > uq; r{s) = otherwise. Then we see that X^^gjo i}* ''('^) — ^ therefore r is 
a lower-computable semi-measure. Since 2^^^^'^ is a universal probability by Theorem 12.31 there 
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exists c £ N"*" such that r(s) < c2~^^^^ for all s G {0,1}*. Hence, if s G A and |s| > tiq, then 
cd > f{H[s)). On the other hand, since A is an infinite set and lim„^oo /('^) = oo, there is sq S ^ 
such that |so| > no and f{H{so)) > cd. Thus, we have a contradiction, and the proof of (ii) is 
completed. □ 

Corollary 3.3. If m is a universal probability and A is an infinite r.e. subset of {0,1}*, then 

— ^gg^ m(s) log2 m(s) diverges to oo. 

Proof. We first note that there is a real number xq > such that the function x2~^ of a real 
number x is decreasing for x > xq. For this xq, there is no G N such that — log2 m{s) > xq for all 
s with |s| > no. On the other hand, by there is c G N such that — log2 m(s) < H{s) + c for all 
s £ {0, 1}*. Thus, we see that 

m(s) log2 m(s) > Yl (^f (s) + c)2--f^(^)-'= 

sSiA & |s|>no s£A & |s|>no 

s£A & |s|>no s£A & \s\>no 

By Theorem 13. 21 (ii). X^seA -^(*)^~^*'^^ diverges to oo. Hence, we see, by the inequality above, that 

— m(s) log2 m(s) also diverges to oo. □ 

By Corollary [331 we see that the Shannon entropy of an arbitrary universal probability diverges 
to oo. 

The proof of Theorem 13.11 is given as follows. 

The proof of Theorem \3.1[ Let pi,P2-,V?,i ■ ■ ■ be a recursive enumeration of the r.e. set dom U . Then, 
for every Z) > 0, — hm^^oo 

WmiQ.D), where 

i=l 

(i) First we show that W{Q^ D) converges to a left-computable real number. Since D <\, there 
is Iq G such that 

J_ _ Q log2 ^ > -, 
D I 

for all I > lo- Then there is mo G such that \pi\ > Iq for all i > mo. Thus, we see that, for each 
i > mo. 

Hence, for each m > mo, 

Wm{Q,D)-W^,{Q,D)= Y < 2-IP'l <Q. 

j=mo+l i=mo 

Thus, since {Wm{Q, D)}m is an increasing sequence of real numbers, it converges to a real number 
W{Q,D) as m ^ oo. Moreover, since Q and D are computable, W{Q,D) is shown to be left- 
computable. 
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We then show that W{Q, D) is Chaitin Z?-random. Let a be the infinite binary string such that 
O.a is the base-two expansion of W{Q,D) mod '1 with infinitely many ones. Then, since Q and D 
are computable real numbers and — 1 + Ca^i < W{Q,D) for all n G N"*", there exists a 

partial recursive function ^: {0, 1}* — > N"*" such that, for all n G N"*", 

\W{Q,D)-\-l + Q.an<W^(^c.,^^{Q,D). 

It is then easy to see that 

Ty(Q,Z))-VF^(„„)(Q,Z)) <2-^ 

It follows that, for all i > ^(a„), 2 5" < 2^" and therefore QDlog2 \pi\ < \pi\ — Dn. Thus, 
given a„, by calculating the set | U{pi) | i < ^(on) } and picking any one finite binary string which 
is not in this set, one can obtain s G {0, 1}* such that QDlog2 H{s) < H{s) — Dn. 
Hence, there exists a partial recursive function ^ : {0, 1}* — > {0, 1}* such that 

QDlog^Hi^ian)) < H{^{an))-Dn. 

Applying this inequality to itself, we have QD log2 n < H{^{an)) — Dn + 0(1). On the other hand, 
using ([2]) there is a natural number cq, such that -ff(^'(a„)) < H(an) + c^- Therefore, we have 

QD log2 n < H{an) -Dn + 0(1). 

Hence, a is Chaitin T-random. It follows that a has infinitely many zeros, which implies that 
W{Q,D) mod 1 = W{Q,D) mod '1 = O.a and therefore {W{Q,D))n = an- Thus, W{Q,D) is 
Chaitin D-random. 

Next, we show that W{Q,D) is /^-compressible. Since Q and D are computable real numbers, 
there exists a total recursive function g: N"^ x Z such that, for all m,n N^, 

WM, D) - [W{Q, D)\ - 2-"5(m, n) | < 2"". (7) 

Let d be any computable real number with D < d < \. Then, the limit value W{Q,d) exists since 
d <1. Let /? be the infinite binary string such that 0./3 is the base-two expansion of VF(Q, d) mod '1 
with infinitely many ones. 

Given n and /?|"_Dn/d] the first \Dn/d\ bits of one can find mo G such that 

This is possible since \W{Q,dy\ - 1 + 0./3p£,„/^-| < W{Q,d) and \\m.m~,ooWm{Q , d) = W{Q,d). It 
is then easy to see that 

x; ip.i«2-¥ <2-^"/'^. 

i=mo+l 

Raising both sides of this inequality to the power d/D and using the inequality a'^ + < (a + bY 
for real numbers a, & > and c > 1, 

oo . . QO II 
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It follows that 
From dZD, dHD, and 
it is shown that 
and therefore 



W{Q,D)-Wrr,,{Q,D) <2-'^. (8) 

[W{Q, D)\ + 0.(VF(Q, D))n - W{Q, D) \ < 2"^^, 
I iWiQ,D))n-gimo,n) | <3 



{W{Q, D))n = g{mo,n), g{mQ, n) ± 1, g(mo, n) ± 2, 

where {W{Q,D)) n is regarded as a dyadic integer. Thus, there are still 5 possibilities of {WiCj-, -D))^, 
so that one needs only 3 bits inorG in order to determine (T4^((I^, Z?))^^. 

Thus, there exists a partial recursive function x {0, 1}* x {0, 1}* {0, 1}* such that 

Vn G N+ 3 s G {0, 1}* |s| = 3 & <I>(n, /Jpon/dl , s) = {W{Q, D))^. 

It follows from ^ that H{{W{Q,D))n) < \l3\Dn/d] \ + o{n) < Dn/d + o(n), which implies that 
W{Q,D) is D/d-compressible. Since d is any computable real number with D < d < 1, it follows 
that W{Q, D) is D-compressible. 

(ii) We choose any one computable real number Q' with Q > Q' > 0. Then, using Theorem 
[O (i), we can show that W{Q', 1) = EpgdomC/ bl*^' 2"'^' diverges to oo. Thus, since Wm{Q, D) > 
Wm{Q' , 1), we see that W{Q,D) also diverges to oo, and the proof is completed. □ 



4 Temperature as a compression rate 

In this section we introduce the notion of thermodynamic quantities such as partition function, 
free energy, energy, entropy, and specific heat, into algorithmic information theory by performing 
Replacements 11.11 for the corresponding thermodynamic quantities in statistical mechanics!! We 
investigate their convergence and the degree of randomness. For that purpose, we first choose 
any one enumeration quQ2,Q3, ... of the countably infinite set dom?7 as the standard one for use 
throughout this sectiono 

In statistical mechanics, the partition function Zsm(T) at temperature T is given by 

Zs^(r) = ^e-&. (9) 
xex 

Motivated by the formula ([9|) and taking into account Replacements II. H we introduce the notion 
of partition function into algorithmic information theory as follows. 

Definition 4.1 (partition function). For each n G N'^ and each real number T > 0, we define 
Zn{T) by 

Z„(T) = 5;2-¥. 

i=l 

^ For the thermodynamic quantities in statistical mechanics, see Chapter 16 of [T] and Chapter 2 of [TS]. To be 
precise, the partition function is not a thermodynamic quantity but a statistical mechanical quantity. 

^ The enumeration {qi} is quite arbitrary and therefore we do not, ever, require {qi} to be a recursive enumeration 
of dom[/. 
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Then, for each T > 0, the partition function Z{T) is defined by Z(T) — lim„^oo 

Since Z{T) = fi^, we restate Theorem 12.61 as in the foUowing form. 

Theorem 4.2 (Tadaki [13 US]). Let T e M. 

(i) If < T < 1 and T is computable, then Z{T) converges to a left- computable real number 
which is weakly Chaitin T-random and T -compressible. 

(a) // 1 < T, then Z[T) diverges to oo. □ 

In statistical mechanics, the free energy Kjm(T) at temperature T is given by 

F,^{T) = -kT\nZ,^{T), (10) 

where Zsra{T) is given by ([9]). Motivated by the formula (jlOp and taking into account Replace- 
ments 11.11 we introduce the notion of free energy into algorithmic information theory as follows. 

Definition 4.3 (free energy). For each n € N"^ and each real number T > 0, we define Fn{T) by 

Fn{T) = -T\og2Zn{T). 

Then, for each T > 0, the free energy F(T) is defined by F[T) = lim„_^oo Fn(T). □ 

Theorem 4.4. Let T e M. 

(i) If < T < 1 and T is computable, then F{T) converges to a right- computable real number 
which is weakly Chaitin T-random and T -compressible. 

(a) If 1 < T, then F(T) diverges to — oo. 

Proof, (i) Since Z{T) converges by Theorem 14.21 (i) and Z(T) > 0, F{T) also converges and 

F{T) = -Tlog^Z{T). 

Note that T is a right-computable real number and — log2 Z(T) > 0. Since Z(T) is a left-computable 
real number by Theorem 14.21 (i). F(T) is a right-computable real number. 

We show that F{T) is weakly Chaitin T-random. By the mean value theorem, there exists 
c G N+ such that, for any A,B A> F{T) and B >1/T then 

< Z(T) - 2"^^ < 2" max{A - F{T),B - l/T}. (11) 

Since F{T) is a right-computable real number, there exists a total recursive function /: — > Q 
such that F[T) < f{m) for all m G and limm^oo/('^) = ^{T). Since T is a computable real 
number, there exists a total recursive function g: N"*" — > Q such that < g{n) — l/T < 2^" for all 
n e N+. 

Given (F(T)) n, one can find rriQ G N"'' such that 

/(mo)< Li^(T)J+0.(F(r))„ + 2-". 
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This is possible because F(r) < [F(r)J +0.(F(T))„ + 2"'^. It follows that < /(mo)-F(r) < 2"". 
Therefore, by ([II]) it is shown that < Z{T) - 2-/("o)9(") < 2"-"-. Let In be the first n bits of the 
base-two expansion of 2-/('^o)9(n) ^itj^ infinitely many zeros. It follows that 

< 0.(Z(r))„ - 0./„ < Z{T) - 2-^('"«)f(") + 2"" < (2^ + 1)2"". 

Hence 

{Z{T))n = Ini + 1, ln + 2, ...,/„ + (2^ + 1), 

where (Z(T))„ and are regarded as a dyadic integer. Therefore, there are still 2*^ + 2 possibilities 
of (Z(T))„, so that one needs only c + 1 bits more in order to determine (Z(T))„. 

Thus, there exists a partial recursive function <I>: {0, 1}* x {0, 1}* {0, 1}* such that 

Vn G N+ 3 s G {0, 1}* \s\=c+l k $((F(r))„, s) = {Z{T))n. 

It follows that there exists c$ G such that, for all n G N^, 

H{{Z{T))n)<H{{F{T))n) + C^. 

Hence, F{T) is weakly Chaitin T-random by Theorem 14.21 (i). 

Next, we show that F{T) is T-compressible. Since T is a computable real number, there exists 
a total recursive function a: {0,1}* x Z such that, for all s G {0,1}* and all n G N"*", if 

O.s > then 

I -T log2 O.s - [_F{T)\ - 2-"a(s, n) | < 2-^^. (12) 

By the mean value theorem, it is also shown that there is d G such that, for all n G N"'', if 
0.(Z(r))„ > then 

I -riog2 0.(Z(T))„ - F{T) I < 2^^-". (13) 

From (HID, ([13]), and 

I LF(r)j +o.(F(r))„-F(r)| <2-", 

it is shown that, for all n G N^, if 0.(Z(T))„ > then 

I {F{T))n-a{{Z{T))n,n) | < 2^^ + 2 

and therefore 

(F(r))„ = a((Z(r))„, n), a((Z(r))„, n) ± 1, a((Z(r))„, n) ± 2, . . . , 
a((Z(r))„,n)±(2'^ + l), 

where {F{T))n is regarded as a dyadic integer. Therefore, there are still 2'^^^ + 3 possibilities of 
{F(T))n, so that one needs only d + 2 bits more in order to determine (F(T))„,. 

Thus, there exists a partial recursive function : {0, 1}* x {0, 1}* — > {0, 1}* such that 

Vn G N+ 3 s G {0, 1}* \s\ = d + 2 k ^'((Z(r))„, s) = (F(r))„. 

It follows that there exists cq, G N"*" such that, for all n G N^, 

H{{F{T))n) < H{{Z{T))n) + c^. 

Since Z{T) is T-compressible by Theorem 14.21 (i), F{T) is also T-compressible. 

(ii) In the case of T > 1, since lim„^oo ■^n(7') = cxd by Theorem 14.21 (ii), we see that T„(T) 
diverges to — oo as n ^ oo. □ 
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In statistical mechanics, the energy Es^{T) at temperature T is given by 

1 

^sm(T) 



i?sm(T) = ^l;^ J^S.e-^, (14) 



where ZsmiT) is given by ([9]). Motivated by the formula ([Mil and taking into account Replace- 
ments [TTTl we introduce the notion of energy into algorithmic information theory as follows. 

Definition 4.5 (energy). For each n € N'*' and each real number T > 0, we define EniT) by 

Then, for each T > 0, the energy E{T) is defined by E{T) = lim„^oo En{T). □ 

Theorem 4.6. Let T G M. 

(i) If < T < 1 and T is computable, then E{T) converges to a left- computable real number 
which is Chaitin T -random and T -compressible. 

(a) If 1 <T, then E{T) diverges to oo. 

Proof, (i) First we show that E{T) converges. By Theorem 14.21 (i), the denominator Zn{T) of 
En(T) converges to the real number Z(T) > as n ^ oo. On the other hand, by Theorem 13.11 (i), 

I 

the numerator Yl^=i i%l of En{T) converges to the real number W{1,T) as n — > oo. Thus, 

En(T) converges to the real number W{1,T)/Z{T) as n ^ oo. 

Next, we show that E{T) is a left-computable real number. Let pi,P2,P3, ■ ■ ■ be a recursive 
enumeration of the r.e. set domU. For each m G N"*", we define Wm(T) and Zm{T) by 



\iH\ ~ ^ ^ \Pi\ 

" " T 



Wm{T) = ^\Pi\ and Zm{T) = ^ 2" 

i=l i=l 

and then define Em{T) by Em{T) = Wm{T)/Zm{T). Since the numerator and the denominator of 
En{T) are positive term series which converge as n — > oo, we see that W{1,T) = limm^oo Wm{T), 
Z{T) = limm^oo Zm(T), and E{T) = limm^oo Em{T)- We then see that 

Err^^iiT) - E^T) = ^-(?IP-+iL-^-^^^2-%^. 

Zm+l{T)Zm{T) 

Since Wm{T) and Zm{T) converge as m ^ oo and limm^oo \Pm\ = oo, there exist a G N+ and 
mi G N"*" such that, for any m > mi, 

~ — |Pm + l| 

S„+i(T) - E^{T) > \pm+i\2 T (15) 

In particular, Em{T) is an increasing function of m for all m > mi by the above inequality. Thus, 
since T is a computable real number, E{T) is shown to be a left-computable real number. 

We then show that E{T) is Chaitin T-random. Let a be the infinite binary string such that O.a 
is the base-two expansion of E{T) mod '1 with infinitely many ones. Then, since T is a computable 
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real number and [i?(T)] — 1 + 0.a„ < E{T) for all n E N^, there exists a partial recursive function 
^: {0, 1}* N+ such that, for ah n G N+, C(a„) > mi and 

\E{T)]-l + 0.an<E^ie.„){T). 

It is then easy to see that E{T) — E^^^^'j{T) < 2~". It follows from (jlSp that, for all i > ^(on), 

IPjI 2 T < 2 " and therefore Tlog2 \pi\ — Ta < \pi\ — Tn. Thus, given a„, by calculating the 
set I I i < ^(on) } and picking any one finite binary string which is not in this set, one can 

obtain s G {0, 1}* such that Tlogs H[s) -Ta< H{s) - Tn. 

Hence, there exists a partial recursive function ^ : {0, 1}* {0, 1}* such that 

riog2 H{^{an)) -Ta< H{^{an)) - Tn. 

Applying this inequality to itself, we have Tlog2 n < H{^(an)) — Tn + 0(1). On the other hand, 
using ^ there is a natural number Cij, such that //(^'(a„)) < H{an) + Therefore, we have 

Tlogan < H{an) -Tn + 0{1). 

Hence, a is Chaitin T-random. It follows that a has infinitely many zeros, which implies that 
E{T) mod 1 = E{T) mod '1 = O.a and therefore {E{T))n = q;„. Thus, E{T) is Chaitin T-random. 

Next, we show that E{T) is T-compressible. Since T is a computable real number, there exists 
a total recursive function g: N"*" x N"*" — > Z such that, for all m,n ^ N^, 



Em{T)-lE{T)\-2~^g{m,n) 
It is also shown that there is c G such that, for all m G 



< 2" 



(16) 



E{T)-E^iT) 



< 2 max 

Let t be any computable real number with T < t < 1, Then, W{l,t) = lim. 



W{1, T) - W^{T) I , I Z{T) - Zm{T) I } . 

xD Wm{t), where 



(17) 



i=l 



The limit value W{i,t) exists since t < 1. Let (5 be the infinite binary string such that 0./3 is the 
base-two expansion of W{\,t) mod '1 with infinitely many ones. 

Given n and f3\^Tn/t\ (i-e-; the first \Tn/t \ bits of /?), one can find tuq G N+ such that 

\W{l,t)']-l + {}.(j^Tn/t\ <Wm,{t)- 

This is possible since t is a computable real number and [VF(l,t)] — 1 + ^-(i^Tn/t] < W^(1)0- It is 
then easy to see that 

|k|2-¥ <2-^"/*. 

i=mo+l 
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Raising both sides of this inequahty to the power t/T and using the inequahty -\-lf < (a + hy 
for real numbers a, & > and c > 1, 

oo I I oo I 



i=m(j+l 



i=mo+l 



and therefore 



It follows that 



> E 2- 

j=mo+l 



M 

T 



< 2- 



and 



z(r)-z„„(r) 



Ty(i,r)-Ty™o(r) 

From (HI]), (dZD, ([H]), and 

I [E{T)\+Q.{E{T))n-E{T) \ <2-", 

it is shown that 



< 2- 



(18) 



(-E(r))„ -5(mo,n) I < 2^ + 2 



and therefore 



(£;(T))„ = g{mo,n), c/(mo, n) ± 1, g{mo,n) ±2, . . . , c/(mo, n) ± {2" + 1), 

where {E(T))n is regarded as a dyadic integer. Thus, there are still 2'^+-'^ + 3 possibilities of {E{T))n, 
so that one needs only c + 2 bits more in order to determine {E{T))n. 

Thus, there exists a partial recursive function N+ x {0, 1}* x {0, 1}* {0, 1}* such that 

VnGN+ 3sG{0,l}* \s\ = c + 2 k -^{n, P^Tn/t], s) = {E{T))n. 

It follows from ([3]) that H{(E{T))n) < \[i\Tn/t\ I + o{n) < Tn/t + o(n), which implies that E{T) is 
T/t-compressible. Since t is any computable real number with T < t < 1, it follows that E{T) is 
T-compr essible . 

(ii) In the case of T = 1, by Theorem 13.11 (ii), the numerator W{1, 1) = Ylpi^domU \p\ 
E{1) diverges to oo. On the other hand, the denominator Z{1) of E(l) converges. Thus, E{1) 
diverges to oo. 

The case of T > 1 is treated as follows. We note that lim„_^oo knl = oo. Given M > 0, there is 
no G N"*" such that \qi\ > 2M for all i > uq. Since lim„^oo Zn{T) = oo by Theorem 14.21 (ii), there 
is rii G N"*" such that 



no 



Zn{T) ^ 



i=l 



M 1 

T < — 

- 2 
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for all n > rii. Thus, for every n > max{no,rai}, 

-I no II -I n II 

2M ^ _M 

j=no+l 

> 2M- = M. 
2 

Hence, lim^^oo En{T) = oo, and the proof is completed. □ 
In statistical mechanics, the entropy Ssm{T) at temperature T is given by 

SsmiT) = ^E,^{T) + klnZ,^iT), (19) 

where ZsmiT) and EgmiT) are given by ([9]) and (fTll) . respectively. Motivated by the formula (fT9]l 
and taking into account Replacements II. H we introduce the notion of statistical mechanical entropy 
into algorithmic information theory as follows. 

Definition 4.7 (statistical mechanical entropy). For each n G and each real number T > 0, 
we define Sn{T) by 

Sn{T) = ^En{T)+log2Zn{T). 

Then, for each T > 0, the statistical mechanical entropy S{T) is defined by S{T) = lim„_^oo Sn{T). 

□ 

Theorem 4.8. Let T e M. 

(i) If < T < 1 and T is computable, then S{T) converges to a left- computable real number 
which is Chaitin T -random and T -compressible. 

(a) If 1 <T, then S{T) diverges to oo. 

Proof, (i) Since Z(T) and E(T) converge by Theorem 14.21 (i) and Theorem 14.61 (i). respectively, 
S{T) also converges and 

S{T) = ^E{T)+log,Z{T). 

Since E(T) is a left-computable real number and E{T) ^ N by Theorem 14.61 (i), there exists 
a total recursive function /: Q such that < f{m) < E{T) and lf{m)\ = [E{T)\ for all 

m G N"*" and limm^oo/("i) = E(T). Since T is a computable real number, there exists a total 
recursive function g: N"*" — > Q such that < g{m) < 1/T for all m £ and liuim^oo sim) = 1/T. 
Since Z{T) is a left-computable real numbers by Theorem 14.21 (i), there exists a total recursive 
function h: Q such that h{m) < log2 Z{T) for all m E N"*" and lim,„^oo h{m) = log2 Z{T). 



16 



Hence, g{m)f{m) + h{m) < S{T) for all m G N"*" and limm_>oo (/(m)/(m) + h{m) = S{T). Thus, 
^(r) is a left-computable real number. 

We then show that S{T) is weakly Chaitin T-random. Let a be the infinite binary string such 
that O.a is the base-two expansion of ^(r) mod '1 with infinitely many ones. 

Given an, one can find nio G N"*" such that 

\S{T)] - 1 + O.an < g{mo)f{mo) + h{mo). 

This is possible because \S{T)'\ — 1 + O.On < S{T) and linim^oo g{m)f{m) + h{m) = S{T). It is 
shown that 

2-" > (^^E{T) +log2 Z{T)^ - {g{mo)f{mo) + h{mo)) 

>^E{T)-g{mo)f{mo) 

>^{E{T)-f{m,)) 
>E{T)-f{mo). 

Thus, < E{T) — fimo) < 2~". Let be the first n bits of the base-two expansion of the rational 
number /(mo) — [fi'mo)\ with infinitely many zeros. It follows that 

< 0.{E{T))n - O.ln < E{T) - fimo) + 2"" < 2 • 2"". 

Hence 

{E{T))n = In, In + 1, 

where {E{T))n and In are regarded as a dyadic integer. Thus, there are still 2 possibilities of 
{E{T))n, so that one needs only 1 bit more in order to determine {E(T))n- 

Thus, there exists a partial recursive function <I>: {0, 1}* x {0, 1} — > {0, 1}* such that 

VnGN+ 36 G {0,1} «>K, 5) = (E(T))„. 

It follows that there exists c$ G such that, for all n G N"^, 

H{{E{T))n) <H{an) + c^. 

Hence, a is Chaitin T-random by Theorem 14.61 (i). It follows that a has infinitely many zeros, 
which implies that S(T) mod 1 = S{T) mod '1 = O.a and therefore {S{T))n = ctn- Thus, S{T) is 
also Chaitin T-random. 

Next, we show that S{T) is T-compressible. Let pi,P2,P3, ■ ■ ■ be a recursive enumeration of the 
r.e. set domf/. For each m G N+, we define VFm(T) and Zjn{T) by 

m II m I I 

Wm{T) = Y,\P^\'^'^ and Z™(T)=^2-^, 
i=i i=i 



and then define Sm{T) by 



Sm{T) = ^^^^ + log, Z„,{T). 
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Since, in the definition of 5„(T), Z„(T) and the numerator and the denominator of En{T) are 
positive term series which converge as n ^ oo, we see that VF(1,T) = hmm_^oo W^m(^), Z{T) = 



hm^^oo Zm{T), and S{T) = hm 

m— >oo Sm{T). 



Since T is a computable real number, there exists a total recursive function a: x 
such that, for all m,n (z N"*", 



Z 



5„(r)- LS(T)J -2-"a(m,n) 
It is also shown that there is d G such that, for all m E N 



< 2- 



SiT) - SUT) I < 2'^ max { I W{1,T) - WmiT) | , | Z{T) - Z„(r) |} . 



(20) 



(21) 



Based on the inequalities (j20p and (j2ip , in the same manner as the proof of the T-compressibility 
of E{T) in Theorem 14.61 (i). we can show that S{T) is T-compressible. 

(ii) In the case of T > 1, since lim„^oo -E'n (?") = oo by Theorem 14.61 (ii) and \og2 Zn{T) is 
bounded to the below by Theorem 14.21 we see that lim„^oo Sn{T) = oo. □ 



Finally, in statistical mechanics, the specific heat Csm{T) at temperature T is given by 

Csm{T) = -^EsmiT), 



(22) 



where -Esm(7') is given by ()14p . Motivated by the formula ()22p . we introduce the notion of specific 
heat into algorithmic information theory as follows. 

Definition 4.9 (specific heat). For each n € and each real number T > 0, we define Cn{T) by 

Cn{T) = Eun 

where E'^[T) is the derived function of En{T). Then, for each T > 0, the specific heat C{T) is 
defined by C(T) = lim^^^oo 

Theorem 4.10. Let T G M. 

(i) If < T < 1 and T is computable, then C{T) converges to a left- computable real number 
which is Chaitin T -random and T-compressible, and moreover C{T) = E'(T) where E'{T) is 
the derived function of E(T). 

(ii) IfT=l, then C{T) diverges to oo. 

Proof, (i) First we show that C(T) converges. Note that 



Cn(T) 



In 2 I y„(r) fWn{T) 



where 



r2 Zn{T) V ZniT) 



yn{T) = Y,\Qi\^'^~^ and Wn^T) =Y,\Qi\^' 



M 

T 



i=l 



i=l 
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By Theorem 14.21 (i), Zn{T) converges to the real number Z{T) > as n ^ oo. On the other 
hand, by Theorem 13.11 (i). Yn{T) and Wn{T) converge to the real numbers W{2,T) and W{1,T), 
respectively, as n — > oo. Thus, C„(r) also converges to a real number C{T) as n — > oo. 

Next, we show that C(T) is a left-computable real number. Let Pi,P2,P3t ■ ■ be a recursive 
enumeration of the r.e. set domf/. For each m € N^, we define 1^(T), Wm{T), and Z„^[T) by 

m II m I I m I 

Y^{T) = Y,\pi\^2-^, Wm{T) = Y,W^^ and Z„(T) = J]2-^, 

i=l i=l j=l 

respectively. Since 1^(T), W„(T), and Zn{T) are positive term series which converge as n ^ oo, 
we see that W{2,T) = limm^oo ^m(r), W{1,T) = liuim^ooWmiT), Z{T) = limm^oo ^m(r), and 
C{T) = limm_oo Cm{T) where 



Cm{T) 



In 2 Yrn{T) (WrniTl 

I Z^{T) \ Zm{T) 



We then see that Cm+i{T) — Cm(T) is calculated as 



Pm+l I 

In 2 2 

T^Z^+i{T) 



,2 J , w^^(r) 

'm+l — < — -— + ~ , , 

1 z„+i(r) z^(r) 



'm+l\ 



+ 



(23) 



Since Ym (T) , (T) , and (T) converge as m ^ oo and lim. 
such that, for any m > mi. 



Zm+l 

(T) Zm{T) I Z^{T) Zm{T) 

\prn\ = cxD, there exists rrii € 



C^+i(T) - C™(r) > 2 T 



m—*co \Fm 



(24) 



In particular, Cm{T) is an increasing function of m for all m > mi by the above inequality. Thus, 
since T is a computable real number, C(T) is shown to be a left-computable real number. 

Based on the computability of T and the inequality ()24p . in the same manner as the proof of 
Theorem 14.61 (i) we can show that C (T) is Chaitin T-random. 

Next, we show that C(T) is T-compressible. Since T is a computable real number, there exists 
a total recursive function g: N"*" x N"*" — > Z such that, for all m,n € N"*", 

C^(r) - IC{T)] - 2-^g{m, n) | < 2^. (25) 

It is also shown that there is c G such that, for all m G N"*", 

C{T) - CmiT) 

< 2^ max { I y(T) - y„(T) | , | W{T) - Wm{T) | , | Z{T) - Z„(T) | } . 
Let t be any computable real number with T < t < 1. Then, W{2, t) = limm^oo Ym(t), where 

Ym{t) = ^\pif2-^. 
i=l 



(26) 
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The hmit value W{2,t) exists since t < 1. Let /3 be the infinite binary string such that 0./3 is the 
base-two expansion of W{2,t) mod '1 with infinitely many ones. 

Given n and P^Tn/f] (i-^-j the first \Tn/t \ bits of /?), one can find mo G N+ such that 

\W{2,t)-]-l+0.P^Tn/t] <Ymo{t). 

This is possible since i is a computable real number and [VF(2,i)] — 1 + 0./?p2"n/i] < It is 

then easy to see that 

oo I , 

J2 \p^f2--^ <2-^^/\ 
«=mo+l 

Raising both sides of this inequality to the power t/T and using the inequality a'^ + b'^ < (a + by 
for real numbers a,b > and c > 1, 

oo I I oo I 



i=mo+l 



and therefore 



2-" > ^ \Pi\2-^ and 2-" > ^ 2"-? 

j=mo+l i=mo+l 



It follows that 



max{| H^(2,r) - y„„(r) I , I W{1,T) - W^,{T) | , | Z{T) - Z„,,{T) |} 
< 2-". 



(27) 



From ([25]), ([261), ([27]), and 



it is shown that 



and therefore 



[C{T)\+Q.{C{T))n-C{T)\<2- 
I (C(T))„-5(mo,n) | < 2^ + 2 



(C(r))„ = c/(mo, n), g{mo,n) ± 1, g{mo,n) ±2, g{mo,n) ± (2"= + 1), 

where (C(T)) n is regarded as a dyadic integer. Thus, there are still 2"^"^^ + 3 possibilities of (C(r))n, 
so that one needs only c + 2 bits more in order to determine (C(X'))^. 

Thus, there exists a partial recursive function N"^ x {0, 1}* x {0, 1}* {0, 1}* such that 



VneN+ 3s G {0,1}* 



c+2 & $(n,/3rTnAl,s) = (C(T))„. 



It follows from ^ that H{{C{T))n) < \i3[Tn/t] I + o{n) < Tn/t + o(n), which implies that C(T) is 
T/t-compressible. Since t is any computable real number with T < t < 1, it follows that C(T) is 
T-compressible. 

By evaluating C„+i(x) — Cn{x) for all x G (0,1) like ([23|) . we can show that C„(x) converges 
uniformly in the wider sense on (0, 1) to C{x) as n — >■ c». Hence, we have C(T) = E'(T). 
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(ii) It can be shown that 



' i=ij=i 

By Theorem [32] (i), Ej=i(kil - kil)^2-l9jl diverges to oo as n ^ oo. Therefore X^Li Ej=i(kil - 
|gj|)^2~l'^*'2~l''jl also diverges to oo as n — > oo. On the other hand, by Theorem 14.21 (i), Z„(l) 
converges as n ^ oo. Thus, C(l) diverges to oo, and the proof is completed. □ 

Thus, the theorems in this section show that the temperature T plays a role as the compression 
rate for all the thermodynamic quantities introduced into algorithmic information theory in this 
section. These theorems also show that the values of the thermodynamic quantities: partition 
function, free energy, energy, and statistical mechanical entropy diverge in the case of T > 1. This 
phenomenon might be regarded as some sort of phase transition in statistical mechanics 



5 Fixed point theorems on compression rate 

In this section, we prove the following theorem. 

Theorem 5.1 (fixed point theorem on compression rate). For every T € (0, 1), if Z{T) is a 
computable real number, then the following hold: 

(i) T is right- computable and not left- computable. 

(ii) T is weakly Chaitin T-random and T -compressible. 

(Hi) Ymin^ooH{Tn)/n = T. □ 

Theorem 15.11 follows immediately from Theorem 15. 3( Theorem 15.51 and Theorem 15.61 below. 
From a purely mathematical point of view, Theorem l5.1l is just a fixed point theorem on compression 
rate, where the computability of the value Z{T) gives a sufficient condition for T to be a fixed point 
on compression rate. Note that Z[T) is a monotonically increasing continuous function on (0,1). 
In fact, |15l I16j showed that Z[T) is a function of class C°° on (0, 1). Thus, since computable real 
numbers are dense in R, we have the following corollary of Theorem 15. 1[ 

Corollary 5.2. The set {T G (0, 1) | lim„^oo H{Tn)/n = T} is dense in [0, 1]. □ 

From the point of view of the statistical mechanical interpretation introduced in the previous 
section. Theorem 15.11 shows that the compression rate of temperature is equal to the temperature 
itself. Thus, Theorem 15.11 further confirms the role of temperature as the compression rate, which 
is observed in the previous section. 

As a first step to prove Theorem 15. H we prove the following theorem which gives the weak 
Chaitin T-randomness of T in Theorem 15.11 

Theorem 5.3. For every T S (0, 1), if Z(T) is a right- computable real number, then T is weakly 
Chaitin T-random. 

^ It is still open whether C{T) diverges or not in the case of T > 1. 
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Proof. Let pi,p2,P3, ■ ■ ■ be a recursive enumeration of the r.e. set domC/. For each k S N^, we 
define a function : (0, 1) ^ M by 

k 

— X — ^ Pi 

Zk{x) = ^2-—. 
1=1 

Then, hmfc^ooZfc(x) = Z{x) for every x E (0,1). On the other hand, since Z(T) is right- 
computable, there exists a total recursive function g: N"*" — > Q such that Z{T) < g{m) for ah 
m G N"*", and limm^oo = Z(T). 

We choose any one real number t with T < t < 1. For each i E N~^, using the mean value 
theorem we see that 

\pi\ \pi\ ln2 , , \pi\ , 

2-— _ 2-— < — 2- — - T) 

for all x G (T, t) . We choose any one c € N with VF(l,f)ln2/r2 < 2^=. Here, the limit value W{\,t) 
exists by Theorem 13.11 (i), since < t < 1. Then, it follows that 

Zfc(x)-Zfe(r) <2^(x-r) (28) 

for ah /c G N+ and X E (r,t). 

We choose any one no G N"*" such that T < O.T„ + 2^" < t for all n > riQ. Such no exists since 
T < i and lim„^oo 0.r„ + 2"" = T. 

Given r„ with n > no, one can find ko, rriQ G such that 

g{mo) < %,(0.r„ + 2-"). 

This is possible from Z{T) < Z(0.r„ + 2""), limfc_,oo ^fc(0.r„ + 2"") = Z(0.r„ + 2""), and the 
properties of g. It follows from Z(T) < g^mo) and (p8]) that 

°° IpI ^ ^ 

2-z^ = z(r) - Zfc„(r) < Zfc„(o.r„ + 2-) - ^^^(r) < 2^-. 

i=fco+l 

Hence, for every i > k^, < 2^^^" and therefore Tn — Tc < \pi\. Thus, by calculating the set 

{U{pi) I i < ko } and picking any one finite binary string which is not in this set, one can then 
obtain an s G {0, 1}* such that Tn — Tc < H{s). 

Hence, there exists a partial recursive function ^ : {0,1}* — > {0,1}* such that Tn — Tc < 
F(^'(r„)) for all n > no. Using ([2]), there is cvt G N+ such that iJ(^'(T„)) < H{Tn) + for all 
n > no. Therefore, Tn — Tc — < H{Tn) for all n > no. It follows that T is weakly Chaitin 
T-random. □ 

Remark 5.4. By elaborating Theorem \4-^ (i), we can see that the left-computability of T re- 
sults in the weak Chaitin T -randomness of Z[T). On the other hand, by Theorem \5.3l the right- 
computability of Z[T) results in the weak Chaitin T -randomness ofT. We can integrate these two 
extremes into the following form: For every T G (0,1], there exists c G N+ such that, for every 
n G N+, 

Tn-c<H{Tn,{Z{T))n), (29) 
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where H{s,t) is defined as H{< s,t >) with any one computable bijection < s,t > from (s,t) G 
{0,1}* X {0,1}* to {0,1}* (see for the detail of the notion of H{s,t)). In fact, if T is left- 
computable, then we can show that H{{Z{T))n) = H{Tn, {Z(T))n) + 0(1), and therefore the in- 
equality (j29p results in the weak Chaitin T -randomness of Z(T). On the other hand, if Z(T) 
is right- computable, then we can show that H{Tn) = H{Tn, (Z(T))„) + 0(1), and therefore the 
inequality (|29p results in the weak Chaitin T -randomness ofT. 

Note, however, that the inequality ([29j) is not necessarily tight except for these two extremes, 
that is, the following inequality does not hold: For every T G (0, 1], 

H{Tn,{Z{T))n)<Tn + o{n), (30) 

where o{n) depends on T in addition to n. To see this, contrarily assume that the inequality (I30p 
holds. Then, by setting T to Chaitin's we have H{0,n) ^ Hi^m (■^(^))n) + 0(1) < Qn + o(n). 
Since Q < 1, this contradicts the fact that Q is weakly Chaitin random. Thus, the inequality (]30p 
does not hold. □ 

The following Theorem 15.51 and Theorem 15.61 give the T-compressibility of T in Theorem 15.11 
together. 

Theorem 5.5. For every T E (0,1), if Z{T) is a right- computable real number, then T is also a 
right- computable real number. 

Proof. Let pi,P2,P3, ■ ■ ■ be a recursive enumeration of the r.e. set domC/. For each k E N^, we 
define a function Zk : (0, 1) — > R by 

k 

Zfc(x) = ^2-¥. 

i=l 

Then, lim/c^oo '^fc(a^) = Z{x) for every x E (0,1). Since Z{T) is right-computable, there exists a 
total recursive function g: — > Q such that Z{T) < g{m) for all m E N"*", and limm^oo 5'('^) = 
Z{T). Since Z{x) is an increasing function of x, we see that, for every a; E Q with < x < 1, 
T < X if and only if there are m,k £ N"*" such that g{m) < Zk[x). Thus, T is right-computable. 
This is because the set { {m,n) \ m G Z &i n & N"*" k. T < m/n} is r.e. if and only if T is 
right-computable. □ 

The converse of Theorem 15.51 does not hold. To see this, consider an arbitrary computable 
real number T E (0,1). Then, obviously T is right-computable. On the other hand, Z{T) is left- 
computable and weakly Chaitin T-random by Theorem l4.2l (i). Thus, Z{T) is not right-computable. 

Theorem 5.6. For every T E (0,1), if Z{T) is a left- computable real number and T is a right- 
computable real number, then T is T -compressible. 

Proof. Let pi,P2,P3, • • • be a recursive enumeration of the r.e. set domC/. For each k E N"*", we 
define a function : (0, 1) ^ M by 

Ux) = ^2-^4. 

i=l 
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Then, hnifc^oo Zk{x) = Z{x) for every x € (0, 1). 

For each i G N^, using the mean value theorem we see that 

PI I IpiI IpiI 

2 — ~ - > (In 2) \pi \2~~{t-T) 

for all t G (T, 1). We choose any one c G such that (In 2) \pi \ 2^^'^ > 2~'^. Then, it follows that 

Zk{t)-Zk{T)>2-'{t-T) (31) 

for all /fc G N+ and t G (T, 1). 

Since T is a right-computable real number with T < 1, there exists a total recursive function 
/: N+ ^ Q such that T < f{l) < 1 for all / G N+, and hm^^oo /(O = T. On the other hand, since 
Z(T) is left-computable, there exists a total recursive function g: Q such that g{m) < Z{T) 

for all m G N"*", and lim^^oo 5'("^) = Z{T). Let /3 be the infinite binary sequence such that 0./3 is 
the base-two expansion of Z{1) (i.e., Chaitin's 0,). 

Given n and P^xn'] (i-^-j tbe first \Tn\ bits of /?), one can find ko G N+ such that 

o./3rTni <X;2-ip< 

i=l 

It is then easy to see that 

oo 

^ 2"IP»I < 2"'^". 
«=fco+l 

Using the inequality a'^ + b'^ < (a + 6)'^ for real numbers a, 6 > and d > 1, it follows that 

Z{T)-Zk,{T) <2'^. (32) 

Note that Zk,{T) < Zk,{f{l)) for ah / G N+, and lim/^oo ^fco(/(0) = Zko{T). Thus, since Zk,{T) < 
Z{T), one can then find /qi^o £ such that 

Zkoifilo)) < 9{mo). 

It follows from ([32]) and ([31]) that 

2-" > g(mo) - 4,(r) > Zk,{f{lo)) - Zk,{T) > 2-%f{h) - T). 

Thus, < /(^o) — T < 2^~"'. Let t„ be the first n bits of the base-two expansion of the rational 
number /(/q) with infinitely many zeros. Then, | /{Iq) — 0-tn | < 2~"'. It follows from | T — O.T„ | < 
2"" that I O.Tn - O.tn I < (2^ + 2)2"'". Hence 

Tji — til, i li i 2, . . . , lb (2 + 1), 

where T„ and tn are regarded as a dyadic integer. Thus, there are still 2^+^ + 3 possibilities of T„, 
so that one needs only c + 2 bits more in order to determine 

Thus, there exists a partial recursive function N"*" x {0, 1}* x {0, 1}* {0, 1}* such that 

VnGN+ 3sG{0,1}* |s| = c + 2 & ^>(n, , s) = T„. 

It follows from ([3]) that H{Tn) < |/?[Tn] |+o(n) < Tn + o{n), which implies that T is T-compressible. 

□ 
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In a similar manner to the proof of Theorem 15. H we can prove another version of a fixed point 
theorem on compression rate as follows. Here, the weak Chaitin T-randomness is replaced by the 
Chaitin T-randomness. 



Theorem 5.7 (fixed point theorem on compression rate II). Let Q be a computable real number 
with Q > 0. For every T G (0, 1), if W{Q,T) is a computable real number, then the following hold: 

(i) T is right- computable and not left- computable. 

(a) T is Chaitin T -random and T -compressible. □ 

Remark 5.8. The computability of Z(T) in the premise of Theorem \5.1\ can be replaced by the com- 
putability of F{T). On the other hand, the computability ofW{Q,T) in the premise of Theorem 5.7 
can be replaced by the computability of E[T) or S{T). □ 



6 Total statistical mechanical interpretation of algorithmic infor- 
mation theory: Physical and informal argument 

In what follows, based on a physical argument we develop a total statistical mechanical interpreta- 
tion of algorithmic information theory which attains a perfect correspondence to normal statistical 
mechanics. In consequence, we justify the interpretation of as a partition function and clarify 
the statistical mechanical meaning of the thermodynamic quantities introduced into algorithmic 
information theory in Section [H In the work [IT], we developed a statistical mechanical interpre- 
tation of the noiseless source coding scheme based on an absolutely optimal instantaneous code 
by identifying a microcanonical ensemble in the scheme. In a similar manner to |I7j we develop a 
statistical mechanical interpretation of algorithmic information theory in what follows. This can 
be possible because the set domC/ is prefix-free and therefore the action of the optimal computer 
U can be regarded as an instantaneous code which is extended over an infinite set. Note that, 
in what follows, we do not stick to the mathematical strictness of the argument and we make an 
argument on the same level of mathematical strictness as statistical mechanics in physics. We start 
with some reviews of statistical mechanics. 

In statistical mechanics we consider a quantum system 5totai which consists in a large number of 
identical quantum subsystems. Let N be the number of such subsystems. For example, ~ 10^'^ 
for 1 cm^ of a gas at room temperature. We assume here that each quantum subsystem can be 
distinguishable from others. Thus, we deal with quantum particles which obey Maxwell-Boltzmann 
statistics and not Bose-Einstein statistics or Fermi-Dirac statistics. Under this assumption, we can 
identify the ith quantum subsystem Si for each i = 1, . . . ,N . In quantum mechanics, any quantum 
system is described by a quantum state completely. In statistical mechanics, among all quantum 
states, energy eigenstates are of particular importance. Any energy eigenstate of each subsystem Si 
can be specified by a number n = 1, 2, 3, ... , called a quantum number, where the subsystem in the 
energy eigenstate specified by n has the energy En- Then, any energy eigenstate of the system 5totai 
can be specified by an A^-tuple (ni,n2, . . . ^un) of quantum numbers. If the state of the system 
5totai is the energy eigenstate specified by (ni,n2, . . . ,nAr), then the state of each subsystem Si is 
the energy eigenstate specified by Ui and the system 5totai has the energy -|- + • • • + Enj^ ■ 
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Then, the fundamental postulate of statistical mechanics, called the principle of equal probability, 
is stated as follows. 



The Principle of Equal Probability: If the energy of the system iStotai is known to have a 
constant value in the range between E and E + 6E, where 6E is the indeterminacy in measurement 
of the energy of the system 5totah then the system iStotai is equally likely to be in any energy 
eigenstate specified by (ni, n2, . . . ,nj\f) such that E < E^^ + En2 + ■ • • + En^^ < E + 5E. □ 

Let @{E, N) be the total number of energy eigenstates of 5totai specified by (rii, n2, • • • , n]\f) such 
that E < Eni + En2 + • • • + -Enjv ^ + The above postulate states that any energy eigenstate 
of sStotai whose energy lies between E and E + 6E occurs with the probability 1/Q{E, N). This 
uniform distribution of energy eigenstates whose energy lie between E and E + 5E is called a 
microcanonical ensemble. In statistical mechanics, the entropy S{E, N) of the system 5totai is then 
defined by 

S{E,N) = klne{E,N), 

where /c is a positive constant, called the Boltzmann Constant, and the In denotes the natural 
logarithm. The average energy e per one subsystem is given by E/N . In a normal case where e has 
a finite value, the entropy S{E, N) is proportional to N . On the other hand, the indeterminacy 
5E of the energy contributes to S{E,N) through the term kln6E, which can be ignored compared 
to N unless 5E is too small. Thus the magnitude of the indeterminacy 5E of the energy does not 
matter to the value of the entropy S{E, N) unless it is too small. The temperature T{E, N) of the 
system 5totai is defined by 

1 _ dS_ 
T{E, N) ~ dE^ ' 

Thus the temperature is a function of E and N. 

Now we give a statistical mechanical interpretation to algorithmic information theory. As 
considered in [3], think of the optimal computer U as decoding equipment at the receiving end 
of a noiseless binary communication channel. Regard its programs (i.e., finite binary strings in 
domC/) as codewords and regard the result of the computation by U, which is a finite binary 
string, as a decoded "symbol." Since domC/ is a prefix-free set, such codewords form what is 
called an "instantaneous code," so that successive symbols sent through the channel in the form of 
concatenation of codewords can be separated. 

For establishing the statistical mechanical interpretation of algorithmic information theory, we 
assume that the infinite binary string sent through the channel is generated by infinitely repeated 
tosses of a fair coin. Under this assumption, the success probability of decoding one symbol is equal 
to Chaitin's halting probability 0,, and the probability of getting a finite binary string s as the first 
decoded symbol is equal to P{s). Hereafter the infinite binary string sent through the channel is 
referred to as the channel infinite string. For each r € {0, 1}*, let Q{r) be the probability that the 
channel infinite string has the prefix r. It follows that Q{r) = 2~I^L Thus, the channel infinite 
string is the random variable drawn according to Lebesgue measure on {0, 1}°°. 

Let be a large number, say N ~ W^^. We relate algorithmic information theory to the 
statistical mechanics reviewed above in the following manner. Among all infinite binary strings, 
consider infinite binary strings of the form piP2 ■ ■ ■ PNCt with Pi,P2, ■ ■ ■ ,PN € domC/ and a G 
{0, 1}°°. For each i, the ith slot fed by corresponds to the ith quantum subsystem <Sj. On the 
other hand, the ordered sequence of the 1st slot, the 2nd slot, . . . , and the Ath slot corresponds to 
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the quantum system 5totai- We relate a codeword p € dom U to an energy eigenstate of a subsystem, 
and relate a codeword length \p\ to an energy En of the energy eigenstate of the subsystem. Then, a 
finite binary string pi ■ ■ -p^ corresponds to an energy eigenstate of 5totai specified by (ni, . . . , riAr). 
Thus, + - • •+|pAr| = \pi---Pn \ corresponds to the energy E'ra^ +•• • + £'„jy of the energy eigenstate 

of «Stotal- 

We define a subset C(L, N) of {0, 1}* as the set of all finite binary strings of the form pi ■ ■ -pj^ 
with Pi G domC/ whose total length \pi ■ ■ -pnI he between L and L + 6L. Then, @{L,N) is defined 
as the cardinality of C{L,N). Therefore, Q{L,N) is the total number of all concatenations of N 
codewords whose total length lie between L and L + SL. We can see that if pi • • -p^ G C{L, N), 
then 2~(^+^^) < Q{pi ■ ■ -pn) < 2~^. Thus, all concatenations pi ■ ■ -pN G C{L, N) of codewords 
occur in a prefix of the channel infinite string with the same probability 2~^. Note here that we 
care nothing about the magnitude of 5L, as in the case of statistical mechanics. Thus, the following 
principle, called the principle of equal conditional probability, holds. 

The Principle of Equal Conditional Probability: Given that a concatenation of codewords 
of total length L occurs in a prefix of the channel infinite string, all such concatenations occur with 
the same probability 1/Q(L, N). □ 

We introduce a micro canonical ensemble into algorithmic information theory in this manner. 
Thus, we can develop a certain sort of statistical mechanics on algorithmic information theory. Note 
that, in statistical mechanics, the principle of equal probability is just a conjecture which is not yet 
proved completely in a realistic physical system. On the other hand, in our statistical mechanical 
interpretation of algorithmic information theory, the principle of equal conditional probability is 
automatically satisfied. 

The statistical mechanical entropy S{L, N) is defined by 

S{L,N) = log2e{L,N). (33) 
The temperature T{L, N) is then defined by 

' §iL,N). (34) 



T{L,N) dL 

Thus, the temperature is a function of L and N . 

According to the theoretical development of equilibrium statistical mechanicsjl we can introduce 
a canonical ensemble into algorithmic information theory in the following manner. We investigate 
the probability distribution of the left-most codeword pi of the channel infinite string, given that 
a concatenation of N codewords of total length L occurs in a prefix of the channel infinite string. 
For each p G {0, 1}*, let R{p) be the probability that the left-most codeword of the channel infinite 
string is p, given that a concatenation of N codewords of total length L occurs in a prefix of the 
channel infinite string. Based on the principle of equal conditional probability, it can be shown 
that 

_ e(L-H,jv-i) 

- — wjr) — • 

From the general definition ()33p of statistical mechanical entropy, we have 

i?(p) = 2^(^-H'^-i)-^(^'^). (35) 



^ We follow the argument of Section 16-1 of Callen [Ij in particular. 
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Let E{L, N) be the expected length of the left-most codeword of the channel infinite string, 
given that a concatenation of codewords of total length L occurs in a prefix of the channel infinite 
string. Then, the following equality is expected to hold: 

S{L, N) = S{E{L, N), 1) + S{L - E{L, N),N - 1). (36) 

Here, the term S{L, N) in the left-hand side denotes the statistical mechanical entropy of the whole 
concatenation of N codewords of total length L. On the other hand, the first term S{E(L, N), 1) 
in the right-hand side denotes the statistical mechanical entropy of the left-most codeword of the 
concatenation of N codewords of total length L while the second term S{L — E{L, N) ,N — 1) in the 
right-hand side denotes the statistical mechanical entropy of the remaining — 1 codewords of the 
concatenation of codewords of total length L. Thus, the equality ()36p represents the additivity 
of the statistical mechanical entropy. We assume here that the equality (|36p holds. 

By expanding S{L — \p\ ,N — 1) around the equilibrium point L — E{L, N), we have 

SiL-\p\,N-l) 
=S{L - E{L, N) + E{L, N)-\p\,N- 1) 

dS 

=S{L - E{L, N),N-l) + —iL- EiL, N), N - 1){E{L, N) - \p\). (37) 

Here, we ignore the higher order terms than the first order. Since ^ 1 and L ^ E(L, N), using 
the definition (j34p of temperature we have 

^(L - E(L, N),N - 1) = ^(L, N) = (38) 
dL^ V ' ^ ! QL^ ' ' T(L,N) ^ ' 

Hence, by ([37|) and ([38]) . we have 

S{L - , AT - 1) = S{L - EiL, N),N-l) + ,^,^^^{E{L, N) - \p\). (39) 



Thus, using (j35|) . (|36|) . and (|39|) . we obtain 

E{L,N)-T(L,N)S(E{L,N).l) \p\ 

R{p) = 2 tJlJT) 2't{l,n) _ 

Then, according to statistical mechanics we define the free energy F{L, N) of the left-most codeword 
of the concatenation of A^ codewords of total length L by 

F{L, N) = E{L, N) - T{L, N)S{E{L, N), 1). (40) 

It follows that 

F{L,N) IpI 

R{p) = 2ni,iv)2 T(L,iv). (41) 
Using X]pedom(7 ^(p) — "^^ show that, for any p G domf/, 

where Z{T) is defined by 

Z{T)= 2"^ (T>0). (43) 

pgdom U 
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Z{T) is called the partition function (of the left-most codeword of the channel infinite string). 
Thus, in our statistical mechanical interpretation of algorithmic information theory, the partition 
function Z(T) has exactly the same form as Q,^. The distribution in the form of R{p) is called a 
canonical ensemble in statistical mechanics. 

Then, using (jH]) and ([12]), F{L,N) is calculated as 

F{L,N)=FiTiL,N)), (44) 

where F(T) is defined by 

F{T) = -Tlog^Z{T) (r>0). (45) 
On the other hand, from the definition of R{p), E{N,L) is calculated as 



Thus, we have 

where E(T) is defined by 



E{N,L)= \p\R{p). 

pSdom U 

E{L,N)=EiT{L,N)), (46) 



E{T) = ^ E l^'l^"^ (^>0)- (47) 

Then, using (00]), dH]), (gS]), and (06]), the statistical mechanical entropy S{E{L,N),l) of the 
left-most codeword of the concatenation of N codewords of total length L is calculated as 

S{E{L,N),l)=S{T{L,N)), 

where S{T) is defined by 

S{T) = ^E{T) + log2 Z{T) (T > 0). (48) 
Note that the statistical mechanical entropy S{E{L, N), 1) coincides with the Shannon entropy 

- Yl ^(P)l0g2^(p) 
pGdom U 

of the distribution R{p). 

Finally, the specific heat C (T) of the left-most codeword of the channel infinite string is defined 

by 

C{T) = E'{T) (r>0), (49) 

where E'[T) is the derived function of E[T). 

Thus, a statistical mechanical interpretation of algorithmic information theory can be estab- 
lished, based on a physical argument. We can check that the formulas in this argument: the 
partition function (031), the free energy (05|) . the expected length of the left-most codeword (07]), 
the statistical mechanical entropy (08]), and the specific heat (09]) correspond to the definitions 
in Section |U Definition 14.11 Definition 14.3] Definition 14.5] Definition 14.7] and Definition 14.91 re- 
spectively. Thus, the statistical mechanical meaning of the notion of thermodynamic quantities 
introduced in Section [4] into algorithmic information theory is clarified by this argument. 
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7 Concluding remarks 



In this paper, we have developed a statistical mechanical interpretation of algorithmic information 
theory by introducing the notion of thermodynamic quantities into algorithmic information theory 
and investigating their properties from the point of view of algorithmic randomness. As a result, 
we have discovered that, in the interpretation, the temperature plays a role as the compression rate 
of all these thermodynamic quantities, which include the temperature itself. Thus, in particular, 
we have obtained fixed point theorems on compression rate, which reflect this self-referential nature 
of the compression rate of the temperature. In the last part of this paper, we have also developed 
a total statistical mechanical interpretation of algorithmic information theory, which realizes a 
perfect correspondence to normal statistical mechanics and motives the above introduction of the 
thermodynamic quantities into algorithmic information theory. However, the argument used in 
the total statistical mechanical interpretation is on the same level of mathematical strictness as 
statistical mechanics. Thus, we try to make the argument a mathematically rigorous form in a 
future study. This effort might stimulate a further unexpected development of the research of this 
line. 
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